12 resultados para multivariate discriminant analysis

em DigitalCommons@The Texas Medical Center


Relevância:

90.00% 90.00%

Publicador:

Resumo:

The ascertainment and analysis of adverse reactions to investigational agents presents a significant challenge because of the infrequency of these events, their subjective nature and the low priority of safety evaluations in many clinical trials. A one year review of antibiotic trials published in medical journals demonstrates the lack of standards in identifying and reporting these potentially fatal conditions. This review also illustrates the low probability of observing and detecting rare events in typical clinical trials which include fewer than 300 subjects. Uniform standards for ascertainment and reporting are suggested which include operational definitions of study subjects. Meta-analysis of selected antibiotic trials using multivariate regression analysis indicates that meaningful conclusions may be drawn from data from multiple studies which are pooled in a scientifically rigorous manner. ^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This study compared three body measurements, height, hip width (bitrochanteric) and foot length, in 120 Hispanic women who had their first birth by cesarean section (N = 60) or by spontaneous vaginal delivery (N = 60). The objective of the study was to see if there were differences in these measurements that could be useful in predicting cephalopelvic disproportion. Data were collected from two public hospitals in Houston Texas over a 10 month period from December 1994 to October 1995. The statistical technique used to evaluate the measures was discriminant analysis.^ Women who delivered by cesarean section were older, shorter, had shorter feet and delivered heavier infants. There were no differences in the bitrochanteric widths of the women or in the mean gestational age or Apgar scores of the infants.^ Significantly more of the mothers and infants were ill following cesarean section delivery. Maternal illness was usually infection; infant illness was primarily infection or respiratory difficulties.^ Discriminant analysis is a technique which allows for classification and prediction to which group a particular entity will belong given a certain set of variables. Using discriminant analysis, with a probability of cesarean section 50 percent, the best combination to classify who would have a cesarean section was height and hip width, correctly classifying 74.2 percent of those who needed surgery. When the probability of cesarean section was 10 percent and probability of vaginal delivery was 90 percent, the best predictor of who would need operative delivery was height, hip width and age, correctly classifying 56.2 percent. In the population from which the study participants were selected the incidence of cephalopelvic disproportion was low, approximately 1 percent.^ With the technologic assistance available in most of the developed world, it is likely that the further pursuit of different measures and their use would not be of much benefit in attempting to predict and diagnose disproportion. However, in areas of the world where much of obstetrics is "hands on", the availability of technology extremely limited, and the incidence of disproportion larger, the use of anthropometric measures might be useful and of some potential benefit. ^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Recently it has been proposed that the evaluation of effects of pollutants on aquatic organisms can provide an early warning system of potential environmental and human health risks (NRC 1991). Unfortunately there are few methods available to aquatic biologists to conduct assessments of the effects of pollutants on aquatic animal community health. The primary goal of this research was to develop and evaluate the feasibility of such a method. Specifically, the primary objective of this study was to develop a prototype rapid bioassessment technique similar to the Index of Biotic Integrity (IBI) for the upper Texas and Northwestern Gulf of Mexico coastal tributaries. The IBI consists of a series of "metrics" which describes specific attributes of the aquatic community. Each of these metrics are given a score which is then subtotaled to derive a total assessment of the "health" of the aquatic community. This IBI procedure may provide an additional assessment tool for professionals in water quality management.^ The experimental design consisted primarily of compiling previously collected data from monitoring conducted by the Texas Natural Resource Conservation Commission (TNRCC) at five bayous classified according to potential for anthropogenic impact and salinity regime. Standardized hydrological, chemical, and biological monitoring had been conducted in each of these watersheds. The identification and evaluation of candidate metrics for inclusion in the estuarine IBI was conducted through the use of correlation analysis, cluster analysis, stepwise and normal discriminant analysis, and evaluation of cumulative distribution frequencies. Scores of each included metric were determined based on exceedances of specific percentiles. Individual scores were summed and a total IBI score and rank for the community computed.^ Results of these analyses yielded the proposed metrics and rankings listed in this report. Based on the results of this study, incorporation of an estuarine IBI method as a water quality assessment tool is warranted. Adopted metrics were correlated to seasonal trends and less so to salinity gradients observed during the study (0-25 ppt). Further refinement of this method is needed using a larger more inclusive data set which includes additional habitat types, salinity ranges, and temporal variation. ^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The purpose of this study was to examine the relationship between enterotoxigenic ETEC and travelers' diarrhea over a period of five years in Guadalajara, Mexico. Specifically, this study identified and characterized ETEC from travelers with diarrhea. The objectives were to study the colonization factor antigens, toxins and antibiotic sensitivity patterns in ETEC from 1992 to 1997 and to study the molecular epidemiology of ETEC by plasmid content and DNA restriction fragment patterns. ^ In this survey of travelers' diarrhea in Guadalajara, Mexico, 928 travelers with diarrhea were screened for enteric pathogens between 1992 and 1997. ETEC were isolated in 195 (19.9%) of the patients, representing the most frequent enteric pathogen identified. ^ A total of 31 antimicrobial susceptibility patterns were identified among ETEC isolates over the five-year period. ^ The 195 ETEC isolates contained two to six plasmids each, which ranged in size from 2.0 to 23 kbp. ^ Three different reproducible rRNA gene restriction patterns (ribotypes R-1 to R-3) were obtained among the 195 isolates with the enzyme, HindIII. ^ Colonization factor antigens (CFAs) were identified in 99 (51%) of the 195 ETEC strains studied. ^ Cluster analysis of the observations seen in the four assays all confirmed the five distinct groups of study-year strains of ETEC. Each group had a >95% similarity level of strains within the group and <60% similarity level between the groups. In addition, discriminant analysis of assay variables used in predicting the ETEC strains, reveal a >80% relationship between both the plasmid and rRNA content of ETEC strains and study-year. ^ These findings, based on laboratory observations of the differences in biochemical, antimicrobial susceptibility, plasmid and ribotype content, suggest complex epidemiology for ETEC strains in a population with travelers' diarrhea. The findings of this study may have implications for our understanding of the epidemiology, transmission, treatment, control and prevention of the disease. It has been suggested that an ETEC vaccine for humans should contain the most prevalent CFAs. Therefore, it is important to know the prevalence of these factors in ETEC in various geographical areas. ^ CFAs described in this dissertation may be used in different epidemiological studies in which the prevalence of CFAs and other properties on ETEC will be evaluated. Furthermore, in spite of an intense search in near 200 ETEC isolates for strains that may have clonal relationship, we failed to identify such strains. However, further studies are in progress to construct suitable live vaccine strains and to introduce several of CFAs in the same host organism by recombinant DNA techniques (Dr. Ann-Mari Svennerholm's lab). (Abstract shortened by UMI.)^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In population studies, most current methods focus on identifying one outcome-related SNP at a time by testing for differences of genotype frequencies between disease and healthy groups or among different population groups. However, testing a great number of SNPs simultaneously has a problem of multiple testing and will give false-positive results. Although, this problem can be effectively dealt with through several approaches such as Bonferroni correction, permutation testing and false discovery rates, patterns of the joint effects by several genes, each with weak effect, might not be able to be determined. With the availability of high-throughput genotyping technology, searching for multiple scattered SNPs over the whole genome and modeling their joint effect on the target variable has become possible. Exhaustive search of all SNP subsets is computationally infeasible for millions of SNPs in a genome-wide study. Several effective feature selection methods combined with classification functions have been proposed to search for an optimal SNP subset among big data sets where the number of feature SNPs far exceeds the number of observations. ^ In this study, we take two steps to achieve the goal. First we selected 1000 SNPs through an effective filter method and then we performed a feature selection wrapped around a classifier to identify an optimal SNP subset for predicting disease. And also we developed a novel classification method-sequential information bottleneck method wrapped inside different search algorithms to identify an optimal subset of SNPs for classifying the outcome variable. This new method was compared with the classical linear discriminant analysis in terms of classification performance. Finally, we performed chi-square test to look at the relationship between each SNP and disease from another point of view. ^ In general, our results show that filtering features using harmononic mean of sensitivity and specificity(HMSS) through linear discriminant analysis (LDA) is better than using LDA training accuracy or mutual information in our study. Our results also demonstrate that exhaustive search of a small subset with one SNP, two SNPs or 3 SNP subset based on best 100 composite 2-SNPs can find an optimal subset and further inclusion of more SNPs through heuristic algorithm doesn't always increase the performance of SNP subsets. Although sequential forward floating selection can be applied to prevent from the nesting effect of forward selection, it does not always out-perform the latter due to overfitting from observing more complex subset states. ^ Our results also indicate that HMSS as a criterion to evaluate the classification ability of a function can be used in imbalanced data without modifying the original dataset as against classification accuracy. Our four studies suggest that Sequential Information Bottleneck(sIB), a new unsupervised technique, can be adopted to predict the outcome and its ability to detect the target status is superior to the traditional LDA in the study. ^ From our results we can see that the best test probability-HMSS for predicting CVD, stroke,CAD and psoriasis through sIB is 0.59406, 0.641815, 0.645315 and 0.678658, respectively. In terms of group prediction accuracy, the highest test accuracy of sIB for diagnosing a normal status among controls can reach 0.708999, 0.863216, 0.639918 and 0.850275 respectively in the four studies if the test accuracy among cases is required to be not less than 0.4. On the other hand, the highest test accuracy of sIB for diagnosing a disease among cases can reach 0.748644, 0.789916, 0.705701 and 0.749436 respectively in the four studies if the test accuracy among controls is required to be at least 0.4. ^ A further genome-wide association study through Chi square test shows that there are no significant SNPs detected at the cut-off level 9.09451E-08 in the Framingham heart study of CVD. Study results in WTCCC can only detect two significant SNPs that are associated with CAD. In the genome-wide study of psoriasis most of top 20 SNP markers with impressive classification accuracy are also significantly associated with the disease through chi-square test at the cut-off value 1.11E-07. ^ Although our classification methods can achieve high accuracy in the study, complete descriptions of those classification results(95% confidence interval or statistical test of differences) require more cost-effective methods or efficient computing system, both of which can't be accomplished currently in our genome-wide study. We should also note that the purpose of this study is to identify subsets of SNPs with high prediction ability and those SNPs with good discriminant power are not necessary to be causal markers for the disease.^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Dialysis patients are at high risk for hepatitis B infection, which is a serious but preventable disease. Prevention strategies include the administration of the hepatitis B vaccine. Dialysis patients have been noted to have a poor immune response to the vaccine and lose immunity more rapidly. The long term immunogenicity of the hepatitis B vaccine has not been well defined in pediatric dialysis patients especially if administered during infancy as a routine childhood immunization.^ Purpose. The aim of this study was to determine the median duration of hepatitis B immunity and to study the effect of vaccination timing and other cofactors on the duration of hepatitis B immunity in pediatric dialysis patients.^ Methods. Duration of hepatitis B immunity was determined by Kaplan-Meier survival analysis. Comparison of stratified survival analysis was performed using log-rank analysis. Multivariate analysis by Cox regression was used to estimate hazard ratios for the effect of timing of vaccine administration and other covariates on the duration of hepatitis B immunity.^ Results. 193 patients (163 incident patients) had complete data available for analysis. Mean age was 11.2±5.8 years and mean ESRD duration was 59.3±97.8 months. Kaplan-Meier analysis showed that the total median overall duration of immunity (since the time of the primary vaccine series) was 112.7 months (95% CI: 96.6, 124.4), whereas the median overall duration of immunity for incident patients was 106.3 months (95% CI: 93.93, 124.44). Incident patients had a median dialysis duration of hepatitis B immunity equal to 37.1 months (95% CI: 24.16, 72.26). Multivariate adjusted analysis showed that there was a significant difference between patients based on the timing of hepatitis B vaccination administration (p<0.001). Patients immunized after the start of dialysis had a hazard ratio of 6.13 (2.87, 13.08) for loss of hepatitis B immunity compared to patients immunized as infants (p<0.001).^ Conclusion. This study confirms that patients immunized after dialysis onset have an overall shorter duration of hepatitis B immunity as measured by hepatitis B antibody titers and after the start of dialysis, protective antibody titer levels in pediatric dialysis patients wane rapidly compared to healthy children.^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Purpose. A descriptive analysis of glioma patients by race was carried out in order to better elucidate potential differences between races in demographics, treatment, characteristics, prognosis and survival. ^ Patients and Methods. Among 1,967 patients ≥ 18 years diagnosed with glioma seen between July 2000 and September 2006 at The University of Texas M.D. Anderson Cancer Center (UTMDACC). Data were collated from the UTMDACC Patient History Database (PHDB) and the UTMDACC Tumor Registry Database (TRDB). Chi-square analysis, uni- /multivariate Cox proportional hazards modeling and survival analysis were used to analyze differences by race. ^ Results. Demographic, treatment and histologic differences exist between races. Though risk differences were seen between races, race was not found to be a significant predictor in multivariate regression analysis after accounting for age, surgery, chemotherapy, radiation, tumor type as stratified by WHO tumor grade. Age was the most consistent predictor in risk for death. Overall survival by race was significantly different (p=0.0049) only in low-grade gliomas after adjustment for age although survival differences were very slight. ^ Conclusion. Among this cohort of glioma patients, age was the strongest predictor for survival. It is likely that survival is more influenced by age, time to treatment, tumor grade and surgical expertise rather than racial differences. However, age at diagnosis, gender ratios, histology and history of cancer differed significantly between race and genetic differences to this effect cannot be excluded. ^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background. Injection drug users (IDUs) are at increased risk for HIV transmission due to unique risk behaviors, such as sharing needles. In Houston, IDUs account for 18% of all HIV/AIDS cases among Black males. ^ Objectives. This analysis compared demographic, behavioral, and psychosocial characteristics of needle sharing and non-sharing IDUs in a population of Black males in Harris County, Texas. ^ Methods. Data used for this analysis were from the second IDU cycle of the National HIV Behavioral Surveillance System. This dataset included a sample of 288 Black male IDUs. Univariate and multivariate statistical analysis were performed to determine statistically significant associations of needle sharing in this population and to create a functional model to inform local HIV prevention programs. ^ Results. Half of the participants in this analysis shared needles in the past 12 months. Compared to non-sharers, sharers were more likely to be homeless (OR=3.70, p<0.01) or arrested in the past year (OR=2.31, p<0.01), inject cocaine (OR=2.07, p<0.01), report male-to-male sex in the past year (OR=6.97, p<0.01), and to exchange sex for money or drugs. Sharers were less likely than non-sharers to graduate high school (OR=0.36, p<0.01), earn $5,000 or more a year (OR=1.15, p=0.05), get needles from a medical source (OR=0.59, p=0.03), and ever test for HIV (OR=0.17, p<0.01). Sharers were more likely to report depressive symptoms (OR=3.49, p<0.01), lower scores on the family support scale (mean difference 0.41, p=0.01) and decision-making confidence scale (mean difference 0.38, p<0.01), and greater risk-taking (mean difference -0.49, p<0.01) than non-sharers. In a multivariable logistic regression, sharers were less likely to have graduated high school (OR=0.33, p<0.01) and have been tested for HIV (OR=0.12, p<0.01) and were more likely to have been arrested in the past year (OR=2.3, p<0.01), get needles from a street source (OR=3.87, p<0.01), report male-to-male sex (OR=7.01, p<0.01), and have depressive symptoms (OR=2.36, p=0.02) and increased risk-taking (OR=1.78, p=0.01). ^ Conclusions. IDUs that shared needles are different from those that did not, reporting lower socioeconomic status, increased sexual and risk behaviors, increased depressive symptoms and increased risk-taking. These findings suggest that intervention programs that also address these demographic, behavioral, and psychosocial factors may be more successful in decreasing needle sharing among this population.^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

An investigation was undertaken to determine the chemical characterization of inhalable particulate matter in the Houston area, with special emphasis on source identification and apportionment of outdoor and indoor atmospheric aerosols using multivariate statistical analyses.^ Fine (<2.5 (mu)m) particle aerosol samples were collected by means of dichotomous samplers at two fixed site (Clear Lake and Sunnyside) ambient monitoring stations and one mobile monitoring van in the Houston area during June-October 1981 as part of the Houston Asthma Study. The mobile van allowed particulate sampling to take place both inside and outside of twelve homes.^ The samples collected for 12-h sampling on a 7 AM-7 PM and 7 PM-7 AM (CDT) schedule were analyzed for mass, trace elements, and two anions. Mass was determined gravimetrically. An energy-dispersive X-ray fluorescence (XRF) spectrometer was used for determination of elemental composition. Ion chromatography (IC) was used to determine sulfate and nitrate.^ Average chemical compositions of fine aerosol at each site were presented. Sulfate was found to be the largest single component in the fine fraction mass, comprising approximately 30% of the fine mass outdoors and 12% indoors, respectively.^ Principal components analysis (PCA) was applied to identify sources of aerosols and to assess the role of meteorological factors on the variation in particulate samples. The results suggested that meteorological parameters were not associated with sources of aerosol samples collected at these Houston sites.^ Source factor contributions to fine mass were calculated using a combination of PCA and stepwise multivariate regression analysis. It was found that much of the total fine mass was apparently contributed by sulfate-related aerosols. The average contributions to the fine mass coming from the sulfate-related aerosols were 56% of the Houston outdoor ambient fine particulate matter and 26% of the indoor fine particulate matter.^ Characterization of indoor aerosol in residential environments was compared with the results for outdoor aerosols. It was suggested that much of the indoor aerosol may be due to outdoor sources, but there may be important contributions from common indoor sources in the home environment such as smoking and gas cooking. ^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Using a retrospective cross-sectional approach, this study quantitatively analyzed foodborne illness data, restaurant inspection data, and census-derived socioeconomic and demographic data within Harris County, Texas between 2005 and 2010. The main research question investigated involved determining the extent to which contextual and regulatory conditions distinguish outbreak and non-outbreak establishments within Harris County. Two groups of Harris County establishments were analyzed: outbreak and non-outbreak restaurants. STATA 11 was employed to determine the average profiles of each category across both the regulatory and socioeconomic (contextual) variables. Cross tabulations of all of the non-quantitative variables were also performed, and finally, a discriminant analysis was conducted to assess how well the variables were able to allocate the restaurants into their respective categories. Contextual and regulatory conditions were found to be minimally associated with the occurrence of foodborne outbreaks within Harris County. Across both the categories (outbreak and non-outbreak establishments), variables included were extremely similar in means, and when possible to observe, distributions. The variables analyzed in this study, both regulatory and contextual, were not found to significantly allocate the establishments into their correct outbreak or non-outbreak categories. The implications of these findings are that regulatory processes and guidelines in place in Harris County do not effectively to distinguish outbreak from non-outbreak restaurants. Additionally, no socioeconomic or racial/ethnic patterns are apparent in the incidence of foodborne disease in the county. ^

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The role of clinical chemistry has traditionally been to evaluate acutely ill or hospitalized patients. Traditional statistical methods have serious drawbacks in that they use univariate techniques. To demonstrate alternative methodology, a multivariate analysis of covariance model was developed and applied to the data from the Cooperative Study of Sickle Cell Disease.^ The purpose of developing the model for the laboratory data from the CSSCD was to evaluate the comparability of the results from the different clinics. Several variables were incorporated into the model in order to control for possible differences among the clinics that might confound any real laboratory differences.^ Differences for LDH, alkaline phosphatase and SGOT were identified which will necessitate adjustments by clinic whenever these data are used. In addition, aberrant clinic values for LDH, creatinine and BUN were also identified.^ The use of any statistical technique including multivariate analysis without thoughtful consideration may lead to spurious conclusions that may not be corrected for some time, if ever. However, the advantages of multivariate analysis far outweigh its potential problems. If its use increases as it should, the applicability to the analysis of laboratory data in prospective patient monitoring, quality control programs, and interpretation of data from cooperative studies could well have a major impact on the health and well being of a large number of individuals. ^